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Abstract 

We analyze the classic board game of Mastermind with n holes and a constant number 
of colors. A result of Chvatal (Combinatorica 3 (1983), 325-329) states that the codebreaker 
can find the secret code with 0(n/logn) questions. We show that this bound remains valid 
if the codebreaker may only store a constant number of guesses and answers. In addition to 
an intrinsic interest in this question, our result also disproves a conjecture of Droste, Jansen, 
and Wegener (Theory of Computing Systems 39 (2006), 525-544) on the memory- restricted 
black-box complexity of the OneMax function class. 

1 Introduction 

The original Mastermind game is a board game for two players invented in the seventies by 
Meirowitz. It has pegs of six different colors. The goal of the codebreaker, for brevity called 
Paul here, is to find a color combination made up by codemaker (called Carole in the following). 
He does so by guessing color combinations and receiving information on how close this guess is to 
Carole's secret code. Paul's aim is to use as few guesses as possible. 

For a more precise description, let us call the colors 1 to 6. Write [n] := {1, . . . , n} for any n E N. 
Carole's secret code is a length-4 string of colors, that is, a z G [6]^. In each iteration, Paul guesses 
a string x £ [6]^ and Carole replies with a pair {eq{z,x),7r{z,x)) of numbers. The first number, 
eq(z, x), which is usually indicated via black answer-pegs, is the number of positions, in which 
Paul's and Carole's string coincide. The other number, tt{z,x), usually indicated by white answer- 
pegs, is the number of additional pegs having the right color, but being in the wrong position. 
Formally eq{z,x) := \{i G [4] | = Xi}\ and tt{z,x) := maxpgs^ \{i £ [4] | Zi = Xp(j)}| - eq{z,x), 
where 5*4 denotes the set of all permutations of [4]. Paul "wins" the game if he guesses Carole's 
string, that is, if Carole's answer is (4,0). 

We are interested in strategies for Paul that guarantee him to find the secret code with few 
questions. We thus adopt a worst-case view with respect to Carole's secret code. This is equivalent 
to assuming that Carole may change her hidden string at any time as long as it remains consistent 
with all previous answers (devil's strategy). 

Previous Results. Mathematics and computer science literature produces a plethora of results 
on the Mastermind problem. For the original game with 6 colors and 4 positions, Knuth [Knu77] 
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showed that Paul needs at most four queries until being able to identify Carole's string (which he 
may query in the fifth iteration to win the game). 

Chvatal [Chv83] studies a general version of this game with k colors and n positions, that is, 
the secret code is a length-n string z G [A;]". Denote by d{n,k) the minimum number of guesses 
that enable Paul to win the game for any secret code. Chvatal proves that for k < n^~^, e > 
an arbitrarily small constant, we have d{n,k) = 0( io^n°^iogfc )- More precisely, he shows that for 

any e > and n sufficiently large, (2 + g) ^\^^n~\ogk S^esses chosen from [k]^ independently and 
uniformly at random, with high probability, suffice to distinguish between all possible codes (that 
is, each secret code leads to a different sequence of answers). Therefore, the secret code can be 
determined after that many guesses. This remains true if Carole replies only with black answer- 
pegs, that is, if for any of Paul's guesses x she reveals to him only eq(z,a;), the number of bits, in 
which her and Paul's string coincide. 

For larger values of /c, the following is known. For n < k < v?, Chvatal proves d{n,k) < 
2nlogk + 4n and for k = a;(n^logn) he shows {k — l)/n < d{n,k) < \k/n] + d{n,'n?). These 
results have subsequently been improved. Chen, Cunha, and Homer [CCH96] show that d{n, k) < 
2n[logn] + 2n+\k/n\+2 ioi k > n. Goodrich [Goo09] proves d{n,k) < n\logk~\ + [(2 — 1/A:)n] +k 
for arbitrary k. 

For k = 2 colors, the Mastermind problem is related to the well-studied coin weighing problem. 
For this reason, first results on this problem date back to years as early as 1963, when Erdos and 
Renyi [ER63] show that d{n,2) = e(n/logn). 

Concerning the computational complexity, Stuckman and Zhang [SZ06] show that it is A^P-hard 
to decide whether a given sequence (xW,(eq»,^»))*=i of queries x^^' and answers (eqW,7r») of 
black and white pegs has a secret code leading to these answers, i.e., whether there exists a string 
z E [A:]" such that eq{z,x^^^) = eq^'^ and tt{z,x^'^^) = tt^*) for all i E [t]. Goodrich [Goo09] proves 
that this is already A^P-hard if we only ask for consistence with the black answer-peg replies eq^*^ 

Our results. Originally motivated by a conjecture on black-box complexities (cf. Section 2), 
we study a memory-restricted version of the Mastermind problem. Since this original motivation 
asks for the case of two colors only, we restrict ourselves to the number k of colors being constant, 
though clearly our methods can also be used to analyze larger numbers of colors. 

The memory-restriction can be briefly described as follows. Given a memory of size m E N, 
Paul can store up to m guesses and Carole's corresponding replies. Based only on this information, 
Paul decides on his next guess. After receiving Carole's reply, based only on the content of the 
memory, the current guess, and the current answer, he decides which m out of the m + 1 strings 
and answers he keeps in the memory. Note that our memory restriction means that Paul truly 
has no other memory, in particular, no iteration counters, no experience that certain colors are not 
used, and so one. So formally Paul's strategy consists of a guessing strategy, which can be fully 
described by a mapping from m-sets of guesses and answers to strings x E [k]^, and a forgetting 
strategy, which maps {m + l)-sets of guesses and answers to m-subsets thereof. 

Clearly, a memory-restriction makes Paul's life not easier. The 0(n/ log n) strategies by Erdos 
and Renyi [ER63] and by Chvatal [Chv83] do use the full history of guesses and answers and thus 
only work with a memory of size 0(n/ log n). Surprisingly, this amount of memory is not necessary. 
In fact, one single memory cell is sufficient. 

Theorem 1. Let E N. For all n E N, Paul has a size-one memory strategy winning the Mas- 
termind game with k colors and n positions in 0{n/ \ogn) guesses. This remains true if we allow 
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Carole to play a devil's strategy and if Carole only reveals the number of fully correct pegs eq{x,z) 
("black answer-peg version of Mastermind"). 

The bound in Theorem 1 is asymptotically tight. A lower bound of ri(n/ log n) is already true 
without memory restrictions. This follows easily from an information theoretic argument, cf. [ER63] 
or [Chv83]. Our result disproves a conjecture of Droste, Jansen, and Wegener [DJW06], who 
believed that a lower bound of r2(n log n) should hold for the 2-color black answer-peg Mastermind 
problem with memory-restriction one. 

The proof of Theorem 1 is quite technical. For a clearer presentation of the ideas, we first 
consider the size-two memory-restricted model, cf. Section 3. The proof of Theorem 1 is given in 
Section 4. Before going into the proofs, in the following section we sketch the connection between 
Mastermind games and black-box complexities. 



2 Mastermind and Black-Box Complexities 

In this section, we describe the connection between the Mastermind game and black-box complexity. 
The reader only interested in the Mastermind result may skip this section without loss. 

Roughly speaking, the black-box complexity of a set of functions is the number of function 
evaluations needed to find the optimum of an unknown member from that set. Since problem- 
unspecific search heuristics such as randomized hill-climbers, evolutionary algorithms, simulated 
annealing etc. do optimize by repeatedly generating new search points and evaluating their objective 
values ("fitness"), the black-box complexity is a lower bound on the efficiency of such general- 
purpose heuristics [DJW06]. 

Black-Box Complexity. Let 5 be a finite set. A (randomized) algorithm following the scheme 
of Algorithm 1 is called black-box optimization algorithm for functions 5 — )• M. 

Algorithm 1: Scheme of a black-box algorithm for optimizing / : 5 ^ M 

1 Initialization: Sample x^^'^ according to some probability distribution p^^^ on S; 

2 Query /(x*^°)); 

3 for t = 1,2,3, .. . do 
Depending on (^{x^^\ /(x^*^))), . . . , {x^^~^\ f{x^^~^^))) choose a probability distribution 
p*-*) on S and sample x^*^ according to p*-*-* ; 
Query /(x^); 



For such an algorithm A and a function / : 5 — )• M, let T{A, /) G M U {oo} be the expected 
number of fitness evaluations until A queries for the first time some x G argmax/. We call 
T{A,f) the runtime of A for f. For a class J- of functions 5 — t- M, the A-black-box complexity 
of J- is T{A,J^) := supjgjr T(^, /), the worst-case runtime of A on J-". Let ^ be a class of 
black-box algorithms for functions 5 — t- M. Then the A-black-box complexity of J- is T{A,J-) := 
inf/ig^ T(y4, J^). If A is the class of all black-box algorithms, we also call T{A,J-) the unrestricted 
black-box complexity of J-. 

As said, the unrestricted black-box complexity is a lower bound for the efficiency of randomized 
search heuristics optimizing J^. Unfortunately, often this lower bound is not very useful. For exam- 
ple, Droste, Jansen, and Wegener [DJW06] observe that the A^P-complete MaxClique problem 
on graphs of n vertices has a black-box complexity of only 0{ii?). 
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Black-Box Algorithms with Bounded Memory. As a possible solution to this dilemma, 
Droste, Jansen, and Wegener suggest to restrict the class of algorithms considered from all black- 
box optimization algorithms to a reasonably large subset. A natural restriction is to forbid the 
algorithm to exploit the whole history of search points evaluated. This is motivated by the fact 
that many heuristics, e.g., evolutionary algorithms, only store a bounded size population of search 
points. Simple hill-climbers or the Metropolis algorithm even store only one single search point. 

Algorithm 2 is the scheme of a black-box algorithm with bounded memory of size //. It is 
important to note that a black-box algorithm with bounded memory is not allowed to access 
any other information than the one stored in the fi pairs . . . , /(rr^^))), which 

are currently stored in the memory and, in the selection step, also the information provided by 
{x^^^~^^\ f{x^^~^^^)). In particular, the algorithm does not have access to an iteration counter. 



Algorithm 2: Scheme of a black-box algorithm with memory of size /u for optimizing function 
f :S -^R 

1 Initialization: ^ 0; 

2 for t = 1,2, . . . do 

3 Depending (only) on A4 choose a probability distribution p on 5 and sample x^^^~^^^ 
according top; // variation step 

4 Query /(x^^^+i)); 

5 Select MCMU {(x^^^+i), of size \M\ < n; //selection step 



Mastermind and the OneMax function class. A test function often regarded to analyze 
how the randomized search heuristic under investigation progresses in easy parts of the search space, 
is the simple OneMax function OneMax : {0, 1}" R, x i-^ Ya=i ^ote that OneMax(x) = 
eq((l, . . . , 1), x) for all x G {0,1}". In fact, for any z G {0,1}", eq(z, •) yields an equivalent 
optimization problem. Let us denote by OneMax„ := {eq{z, ■) \ z £ {0, 1}"} the class of all these 
functions. 

Due to a coupon collector effect, many classical randomized search heuristics like randomized 
local search or the (fj, + A) evolutionary algorithm (with ^u, A constants) need ©(nlogn) function 
evaluations to optimize OneMax„. 

As a moments thought reveals, black-box algorithms optimizing OneMax„ correspond to strate- 
gies for Paul in the Mastermind game (without memory restriction) with two colors and only black 
answer-pegs used. Hence the unrestricted black-box complexity of OneMax„ is 0(n/logn) by the 
results of Erdos and Renyi [ER63] and Chvatal [Chv83] . 

This connection was seemingly overlooked so far in the randomized search heuristics community, 
where Droste, Jansen, and Wegener [DJW06] prove an upper bound of 0{n) and later Anil and 
Wiegand [AW09] prove the asymptotically correct bound of 0(n/ log n). Since already the first 
bound is lower than what many randomized search heuristics achieve, Droste, Jansen, and Wegener 
suggest to investigate the memory-restricted black-box complexity of OneMax^. They conjecture 
in [DJW06, Section 6] that a memory restriction of size one leads to a black-box complexity of 
order O(nlogn). 

Again, clearly, the memory-restricted black-box complexity of OneMax„ and optimal strategies 
for Mastermind with two colors, black answer-pegs only, and a corresponding memory restriction 
are equivalent questions. Consequently, our result can be rephrased to saying that the black-box 
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complexity of OneMax,„ even with the memory restricted to one is 0(n/logn), disproving the 
conjecture of Droste, Jansen, and Wegener. 

3 The Mastermind Game with Memory of Size Two 

Since the proof of Theorem 1 is quite technical, we give in this section a simpler proof showing that 
with a memory of size two Paul can win the game using only 0(n/ log n) guesses. Already this 
proof contains many ingredients needed to prove Theorem 1, e.g., the use of the random guessing 
strategy with limited memory, the block-wise determination of the secret code, and the simulation 
of iteration counters in the memory. 

Let A; > 2 be the number of colors used. In particular for A; = 2, it will be convenient to label 
the colors from to A; — 1. Let us denote the set of colors by C := [0..k — 1] := {0,1, . . . ,k — 1}. 
We assume that A; is a constant and that the number n of positions in the string is large, that is, 
all asymptotic notation is with respect to n. 

Theorem 2. Paul has a size-two memory strategy winning the black answer-peg only Mastermind 
game with k colors and n positions in 0{n/ logn) guesses. This remains true if we allow Carole to 
play a devil's strategy. 

As many previous works, the proof of Theorem 2 heavily relies on random guessing. For the 
case of A: = 2 colors, already Erdos and Renyi [ER63] showed that there is a t G G(n/logn) such 
that t guesses x^^\ . . . chosen from {0, 1}" independently and uniformly at random, together 
with Carole's black answer-peg answers, uniquely define the hidden code. This was generalized by 
Chvatal [Chv83] to the following result. 

Theorem 3 (from [Chv83]). Let e > 0, let n > n{e) be sufficiently large and let k < n^~^ . 
Let x^^\ . . . ,x^^'^ be t > (2 + g) ^\^^^l\^gk samples chosen from C" independently and uniformly at 
random. Then for all z £ C" , the set 

^consistent ._ ^y^Qu^y-^ . eq(y,a;«) = eq{z,x^'^)} 

satisfies E[|5'=''''"*"*^"*|] < 1 + 1/n. 

Since the strategy implicit in Theorem 3 needs a memory of size 6 (n/ logn), we cannot apply 
it directly in our setting. We can, however, adapt it to work on smaller portions ("blocks") of the 
secret code, and this with much less memory. 

Let ?/ G C" and let B C [n] be a block (i.e., an interval) of size s := \\/n\. As we shall see, by 
t G 0(s/logs) times guessing a string obtained from y by replacing the colors in B by randomly 
chosen ones (and guessing k additional reference strings), we can determine the part of the 
secret code z in block B. 

We can do so with a memory of size two only. We store the string obtained from y by altering 
it on B (sampling string) in one cell. Note that we do not need to remember y, as we only need to 
ensure that our guesses agree in the positions [n] \ B. We use the other memory cell (storage string, 
in the following typically denoted by x) to store the random substrings of length s substituted 
into y at B, and Carole's answers. Note that each such answer can be encoded in binary using 
in G 0(log n) entries of the string. Hence the t guesses and answers can be memorized using a total 
number of t(s + in) = 0(n/ logn) positions. 
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Algorithm 3: An almost size-two memory-restricted algorithm winning the fc-color black 
answer-peg only Mastermind game in 0{n/ logn) guesses. Remark: x denotes the unique 
string in M with = 1 and y denotes the unique string in M with y„ = 0. 

1 Initialization: y [0 ... 0]; 

2 Query eq{z,y) and update ^ {{y,^(i{z,y))}; 
for i = 1 to \{n — l)/s] do 

X -(r- [0 . . . 0|1]; //initialization of x 
Query eq(z, x) and update AJ by adding (i 
for g = Otot + A: — Ido 

if q < k then y ^ substitute(y, Bi, [q ■ ■ ■ q]) ', / /reference string 
else y ^ substitute(y, r) where r € C''^'' u.a.r.; //random guess 
Query eq{z,y) and update Ai by replacing (y,eq(z,y)); 



3 
4 
5 
6 
7 
8 
9 
10 
11 

12 
13 
14 



1) or replacing (i > 1) {x,eq{z,x)) in Al; 



,,(^.)|BLOCK,(?/)|binary^Jeq(z,y))|l|0...0|l] ; //add y's info to 



Query eq(2;, x) and update ^A by replacing {x,eq{z,x)); 

while Ai(y) < \Bi\ do 

y ^ substitute(y, Sj, it;), where w G ^consistent ^_g^_j. . 
Query eq(z,y) and update AJ by replacing {y,eq{z,y)); 



15 while eq{z,y) < n do y substitute(y, {n}, c), where c G C u.a.r., and query eq{z,y); 



This approach allows us to determine s positions of z using t = 0(s/logs) guesses. Hence we 
can determine the secret code z with t[n/s] = 0(n/ log n) guesses as desired. 

In Algorithm 3 (notation used will be introduced below) we make this strategy more precise 
by giving it in pseudo-code. Note, however, that this algorithm does not fully satisfy the size-two 
memory restriction. The reason is that the queries do not only depend on the current state of the 
memory, but also on iteration counters and, e.g. in lines 9 and 11, on the program counter. Further 
below, in Algorithm 4 we shall remove this shortcoming with a few additional technicalities, which 
we are happy to spare for the moment. 

Before we argue for the correctness of Algorithm 3, let us fix the notation. For any string 
X E C" we also write x = [xi...x„]. To ease reading, we allow ourselves to indicate different 
structural components of x by vertical bars, e.g., x = [xi . . . Xp\xp+i . . . Xn]- For i £ [\{n — l)/s]] 
let Bi := {{i — l)s -|- 1, . . . , is} n [n — 1], the positions of the i-th block. Set 

BLOCKi(x) := X|b, := [x(j_i)^+i . . . x^in{is,n-i}] , 

the i-th block of x. For any string r € C'^'' we define 

substitute(x, Bi, r) := [xi . . . |r |xmin{is,n-i}+i ■■■Xn], 

the string with the i-th block substituted by r. Similarly, let substitute(y, {n}, c) := [yi . . . yn_i|c]. 
Note that we do not assign the n-th position to any of the blocks. We do so because in Algorithms 3 
and 4 we shall use that position to indicate, which one of the two strings in the memory M is the 
storage string (the unique x £ Ai with x„ = 1) and which one is the sampling string (the unique 
string y £ Ai with y„ = 0). 

Let pi{x) := max{i € [n — 1] | Xj = 1}, the largest position i < n of x with entry "1". As 
mentioned above, we encode Carole's answers eq(z,y) G [0..n] in binary, using := [log?i] -|- 
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1 positions, and we denote this binary encoding of length in by binary^^(eq(2;, y)). By Aj(y) 
we denote the contribution of the i-th. block to the value eq{z,y), i.e., eq{z,y) is the number of 
positions in the i-th block, in which Paul's guess y and Carole's secret code z coincide. Formally, 
Ai{y) := eq{z\Bi,y\B^)- Lastly, let ^consistent strings w of length \Bi\ such that 

suhst itute{z, Bi,w) is consistent with all of Carole's replies (formal definition follows). We shall 
see below that both Aj(y) and 5?°°sistont -^^ computed solely from the content of the memory 
cells (lines 12-14). 

We now argue for the correctness of Algorithm 3. Let us consider the state of the memory after 
having sampled all t random samples for the i-th block (that is, we are in lines 12-14). We show 
that based on the information given in the memory, we can restore the full history of guesses for the 
i-th block. To this end, first note that for any guess y done in line 9, we used s + + 1 positions in 
X for storing its information (line 10; we add the additional "1" at the end to ease determining via 
Pi{x) the positions in x, which have not yet been used for storing information). In lines 6-11 we 
first asked and stored k non-random guesses x'^ = substitute(y, Bi, [c. . . c]) and we stored these 
reference strings together with Carole's replies eq{z,x'^) = J2h=i'^'^~^^c{s+er,+i)-hj ^ ^ [Q..k — 1]. 
Therefore, for j e [t], the j-th random sample is r^^) = . . . 

and the corresponding query was y^^^ = substitute(y, Sj, r'^-'^). We have stored Carole's reply to 
this guess in binary, and we can infer eq{z,y^^^) = Yl^h=i ^{k+j){s+i„+i)-h- This shows how to 
regain the full guessing history. 

Next we show how to compute the contributions Aj (?/(•')) of the entries in the i-th block. To this 
end, note that the constant substrings [c . . . c] in the reference strings x'^ in total contribute exactly 
\Bi\ to the sum eq(z, x^) + . . . + eq{z, x^). Formally, Y!IzI eq([z^i_i)s+i ■ ■ ■ ^min{js,n-i}], [c . . . c]) = 
\Bi\. Since all other positions of the sampling string y are not changed during the phase, in which 
we determine the i-th block, we infer that 

Ai(y(j')) = eq(z,y(^')) - (eq(z,xO) + . . . + eq(z, x^) - \Bi\)/k. 

Consequently, in lines 12-14, the algorithm can compute Aj(y(-'^) for all j G \t\. From this it can 
infer 

^consistent ._ |~ ^ (.\B,\ | y- ^ . eq(5, BLOCKi (y(^') )) = Ai(y(j'))}, 

the set of possible code segments in B^. By Theorem 3, the expected size of ^consistent -g bounded 
from above by l + l/|i?j|. Thus, in lines 12-14 we need an expected number of l + l/|i?j| samples w 
chosen from ^jconsistent uniformly at random until we find ay = substitute(?/, Bi, w) with Aj(?/) = s 
(which implies that the i-th block of y coincides with Carole's secret code). This shows how we 
determine the entries of the i-th block in an expected total number of t = 0{s/ log s) guesses. 

When Algorithm 3 executes line 15, all but the last entry of y coincide with Carole's secret 
code. Hence trying random colors in the n-th position finds the hidden code z with an additional 
expected number of A; = 6(1) guesses. 

To turn Algorithm 3 into a size-two memory-restricted one, we use the first in entries of x to 
store in binary the iteration counter i, which indicates the index of the block currently being under 
consideration. This will move the storage space for the guesses and answers by positions to the 
right. Formally, we define i{x) := Ylh'=o '^^^tn-h- The inner for loop needs no additional memory 
to be simulated, because we can learn from pi{x) how many guesses q{x) have been queried already. 
More precisely, since storing each guess requires s+^^+l positions and the first in positions are used 
for indicating the number of already determined entries, we have q{x) := (^1(2;) — in)/{s + + 1) • 
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Algorithm 4: A size-two memory-restricted algorithm winning the /c-color black answer-peg 
only Mastermind game in 0(n/ log n) guesses. Remark: x denotes the unique string in Ai 
with = 1 and y denotes the unique string in Ai with yn = 0. 



' ; // clear memory 



1 Initialization: Let M 

2 if = then 

3 y [0...0] ; //first reference string 

4 Query eq{z,y) and update Al ^ {{y,eq{z,y))}; 

5 else if \M\ = 1 then 

6 x [0...0|1] ; //initialization of storage string 

7 Query eq{z,x) and update Af ^ Ai U {{x,eq{z, x))}; 

8 else if i{x) < \{n — l)/s] then 

9 
10 

11 

12 
13 
14 
15 

16 
17 
18 

19 
20 

21 
22 



if X = [0 . . . 0|1] or A,(,)(y) = then 

x^ [binary^^(i(x) -M)|BL0CKj(3,)+i(y)|binary^^(eq(2:,y))|l|0. . . 0|1] ; //clear storage 
string and add first reference string 
Query eq{z,x) and update Ai by replacing {x,eq{z, x)); 

else if Part{y, x) = 1 and q{x) < t + k then 

if q{x) < k then y ^ substitute(y, -Bj(2,.), [q{x) ■ ■ ■ q{x)]) ; //reference string 
else y ^ substitute(y, Sj^^.) , r) where r G C'^'(^)' u.a.r.; //random guess 
Query eq(2:, y) and update Ai by replacing {y,eq{z,y)); 

else if Part{y,x) = and Aj(^)(?/) < then 

X ^[xi. . .a;p^(^)|BL0CKi(2.)(y)|binary^^(eq(2;,y))|l|0. . . 0|1]; //add y's info to x 
Query eq(z, x) and update Ai by replacing {x,eq{z, x)); 

else if Part{y, x) = 1 and q{x) = t + k then 

y ^ substitute(y, t/;) where w G ^^consistcnt (^j^ggen u.a.r.; 

Query eq{z,y); 

if ^i(x){y) = \Bi(x)\ then Update M by replacing {y,eq{z,y)); 



23 else if i{x) = \{n — l)/s] then 

24 y -^r- substitute(y, {n}, c) where c € C\{y„,} u.a.r.; 

25 |_ Query eq(z, y) ; 

26 Go to line 2; 



Lastly, we need to replace the sequential queries in lines 9 and 11 of Algorithm 3 (as this exploits 
information stored in the program counter). Fortunately, again we can deduce from the memory 
where we stand. We define a function Part(y,x), which equals 1 if the information of y has been 
added to the storage string x already and which equals otherwise. That is, we set 

Part(y x) = I^i=l ^'"^^Pi W-^ = ^^(^' ^) ^^"^ ^^^^^^{x)iy) = [Xpi{x)-e„-\B,^^^\ ■ ■ ■ Xp,(a:)-e^~l] 

lo, otherwise. 

Note that Part(y,x) = 1 indicates that the information of y has been stored in x also in the case 
that our current sample equals the previous one. This is no problem as then the current guess 
does not give any new information. Hence the use of Part modifies the algorithm to sample t 



8 



random guesses without immediate repitition. Note that the probabihty to sample the same string 
r G c'^'f"")' twice in a row is at most 1/2 (if the last block consists only of one position and k = 2) 
and is typically much smaller. Hence, occurrences of this event have no influence on the asymptotic 
number of guesses needed to win the game. 

With these modifications, Algorithm 3 becomes the truly size-two memory-restricted Algo- 
rithm 4. 

4 Memory of Size One: Proof of Theorem 1 

Compared to the situation in Section 3, Paul faces two additional challenges in the size-one memory- 
restricted setting. The obvious one is that he has less memory available, in particular, after a large 
part of the code has been determined and needs to be stored. The more subtle one is that he cannot 
any longer query a search point and then store whatever is worth storing in the second memory 
cell. With one memory cell, all he can do is to guess a new string and keep or forget it. 

4.1 Linear Time Strategies 

Before we give a proof of Theorem 1, let us discuss a linear time winning strategy, i.e., a strategy 
that allows Paul to find Carole's secret code in a linear expected number of guesses using one 
memory cell only. This linear time strategy will be used in the proof of Theorem 1 to determine 
the last Q{n/ \ogn) entries of the secret code. 

The basic idea of the linear time strategy is to test each position one by one, from left to right. 
Since we have just one memory cell, we need to indicate in this one string, which entries have 
been determined already. We do so by keeping all not yet determined entries at one identical value 
different from the one of the entry determined last. To this end, let us for all x E define 

tn{x) := min{i G [n] | Vj G {«,..., n} : Xj = Xi} , 

the tail number of x. The following lemma describes the linear time strategy. 

Lemma 4. Let x £ C^. Furthermore, let us denote Carole's secret code by z £ C^. Let us assume 
that the firsttn{x) — l entries of z have been determined (i.e., Carole can no longer change the entries 
of [zi . . . 2tn(a;)-i]J- Further assume that xi = Zi for all i < tn(x) and that M = {{x , eq{z , x))} is 
the current content of the memory cell. 

There is a size-one memory-restricted guessing procedure LinAlg that — even if Carole plays a 
devil's strategy — after an expected constant number of successive calls modifies the memory such 
that the string y now in the memory satisfies yi = Zi for all i < tn(x) and tn(y) = tn{x) + 1. Every 
call of LinAlg requires only one guess. 

Interestingly, for the definition of LinAlg, we need to distinguish between the cases of /c = 2 and 
A; > 3 colors, as certain arguments exploit particular properties of these cases. For k = 2 colors, we 
have analyzed this strategy already in [DWll]. 

4.1.1 The case oi k = 2 colors C = {0, 1} 

In this section we prove that for k = 2 colors C = {0, 1}, LinAlg is a procedure that requires, in 
expectation, three calls to modify the memory content by replacing the current string x that is 
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assumed to satisfy the conditions of Lemma 4, by a string y with tn(?/) = tn(x) + 1 and yi = Zi for 
all i < tn(x). 

For all i € [n] let e" be the i-th unit vector of length n. 
Algorithm 5: Routine LinAlg for k = 2 colors 

1 Assumption: The string x G {0, 1}" in the memory satisfies tn(x) < n and Xj = Zi for all 
i < tn(x); 

2 Sample y e {x ® e^^(^^yX © Y^^=tn{x)+i ^i'} uniformly at random; 

3 Query eq{z,y); 

4 if y = X® e"j^(^) then 

5 I ii eq{z,y) > eq{z,x) then M ^ {{y,eq{z,y))}; 

6 else 

7 1^ if eq(2:, x) + eq{z, y) = n + tn{x) then A4 <^ {{y, eq{z, y))}; 



Proposition 5 (from [DWll]). For k = 2 colors, Algorithm 5 satisfies the conditions of Lemma 4- 
In expectation, three calls to routine LinAlg suffice. 

Proof. Let x € {0, 1}" be a bit string with tn(x) < n and Xj = Zi for all i < tn(x). 

Algorithm 5 samples with probability 1/2 the string y = x © ^tn{x)^ ^^'^ with probability 1/2 it 
samples y = x © X]r=tn(a:)+i That is, either it flips only the tn(x)-th bit of x or it flips all "tail 
bits" but the tail numbered one. 

If y = X © e^^(^), clearly we have Ztn(x) = ytn{x) if and only if eq(2;, y) > eq{z, x). 

Therefore, let us assume that Algorithm 5 samples y = x (B l^"=tn(x)+i ^7- show that 
Ztn{x) = ytn{x)i= Xtn{x)) holds if and only if eq(z,x) + eq(2;,y) = n + tn(x). By definition we have 
yi = Xi = Zi for all i < tn(x). Thus, the first tn(x) — 1 bits of x and y contribute 2(tn(x) — 1) to 
the sum eq(z, x) + eq{z, y); formally, 

eq([zi . . . Ztn{x)-i], [xi... xtn(x)-i]) + eq([2;i • • • Ztn{x)-i], [yi • • • ytn{x)-i]) = 2(tn(x) - 1) . 

On the other hand, for all i > tn(x) either have = Xj or = 1 — Xj = yj. Thus, the last last 
n — tn(x) bits of x and y contribute exactly n — tn(x) to the sum eq{z, x) + eq{z, y); formally, 

eq([^;tn(x)+l ■■■Zn], [Xtn(x)+1 ■ ■ ■ Xn]) + eq{[ztn{x)+l • • • ^n], [ytn{x)+l ■ ■ ■ yn]) = n - tn(x) . 

By definition we also have ytn(x) = Xtn(x) and, thus, 
eq(z,x) + eq(z,y) 

= eq([2:i . . . Ztn(x)-l], [Xl... Xtn(x)-l]) + eq{ztn(x) , Xtn{x)) + eqi[ztn(x)+l ■ ■ ■ Zn], [Xtn{x)+l ■ ■ ■ Xn]) 
+ eq([zi . . .2tn(x)-l], [yi ■ ■■ytn{x)-l]) +(i^{Ztn(x)iXtn{x)) + eq([2;tn(x)+l • • - ^n], [ytn(x)+l ■■■yn]) 

= 2(tn(x) - 1) + n-tn(x) + 2eq(ztn(x), 3;tn(x)) 
= re + tn(x) + 2 eq(ztn(x) , 2;tn(x) ) - 2 

This shows that eq(z, x) + eq(2;, y) = n + tn(x) if and only if eq(ztn(x)) 3;tn(x)) = 1) i-^-j if and only 

if 2:tn(a;) = a;tn(x) (= ytn(x))- 
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It is immediate that for a secret code z taken from {0, l}" miiformly at random, the probabihty 
to obtain, in one call of LinAlg, a string y with tn(y) = tn(a;) + 1 and yi = Zi for all i < tn{y) 
is 1/2. This shows that, if Carole does not play a devil's strategy and if her string is taken from 
{0, 1}" uniformly at random, we need, on average, two successive calls to procedure LinAlg until 
we obtain a string y as desired. 

Proposition 5 follows from the easy observation that it takes, on average, three iterations until 
both y = X (B and y = x (B ^^=tn{x)+i have been sampled. That is, even if Carole plays a 

devil's strategy, three calls of Algorithm 5 force her to accept one entry Ztn(a;) £ {Oj !}• ^ 

4.1.2 The case of A; > 3 colors C = [0..k - 1] 

The main argument of Proposition 5, namely that Ylc=o ^^i^-: [c . • c]) = n, seems hard to extend 
to more than two colors with no additional memory. However, having more than two colors can be 
exploited in a different way as it gives more than one way to mark the tail [2;tn(x) • • • ^n] of a search 
point X. 

Proposition 6. For k>3 colors, Algorithm 6 satisfies the claims of Lemma 4- 



Algorithm 6: Routine LinAlg for A; > 3 colors 

1 Assumption: The string x G in the memory satisfies tn(a;) < n and Xj = Zi for all 

i < tn(x); 

2 With probability {k - l)/k sample y G {[xi . . . Xtn{a;)-i|jktn(x)+i ■■■Xn] \ j & C\{xtn(x)}} 
uniformly at random and with probability 1/k sample 

y G {[xi . . . Xtn(x)-i|j • • • j] lie C\{xtn(x)-i}} Uniformly at random; 

3 Query eq{z,y); 

4 if y — [xi . . . Xi^(^x)~i\j\xtn{x)+i ■ ■ ■ Xn]j j 7^ ^^^^(x) then 

5 I if eq{z,y) > eq{z,x) then M ^ {{y,eq{z,y))}; //zt,^(^^) = j 

6 else 

7 \_ M ^ {{y,eq{z,y))}; 



Proof. Let x G C" with tn(x) < n and Xj = Zi for all i < tn(x). If y = 
[xi . . .Xtn(x-)-ib'ktn(x)+i • • -a;™], then clearly we have eq{z,y) > eq(z,x) if and only if ytn(x) = 
j = 2:tn(x')- Therefore, all we need to show is that, using the strategy of Algorithm 6, it takes a 
constant number of guesses until for each j £ C there exists an ij G C\{j} such that we have queried 
both X = [zi . . . 2tn(a;)-ilb' • • • V ~ [^i ■ ■ ■ •2^tn(x)-i I J • • • b'] ™ Subsequent guesses. This 

follows essentially from the fact that k is constant. 

More precisely — regardless of the current search point x — for any bitstring y = 
[zi . . . ^tn(a;)-iblb' • • • probability to sample y in the second of two subsequent calls to Algo- 

rithm 6 is constant. Therefore, the expected number of calls to Algorithm 6 until y is sampled is 
constant. The claim follows by the linearity of expectation. □ 

4.2 Proof of Theorem 1 

Building on LinAlg and the block-wise random guessing strategy introduced in Section 3, we can 
now present Paul's winning strategy for the single memory cell setting, which proves Theorem 1. 
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Proof of Theorem 1. The structure of this proof is as fohows. First we sketch the main ideas and 
give a high-level pseudo-code for the size-one memory-restricted strategy winning the black answer- 
peg only Mastermind game with k colors in 0(n/log?7-) guesses. After fixing some notation, we 
then present more details for the different phases, in particular for the random guessing phase, 
which is the most critical part of this proof. We present here the details of Paul's strategy for the 
case of A; = 2 colors. The generalization to A; > 3 colors is pretty much straightforward. Some 
remarks on the differences between the case of /c = 2 and A; > 3 colors are given at the end of this 
proof. 

Let us begin with the rough overview over Paul's strategy. He determines the first n—Q{n/ \ogn) 
positions using random guessing, where he manages to store the random substrings and Carole's 
answers in the yet undetermined part of his one string in the memory. As in the proof of Theorem 2, 
he does so by iteratively determining blocks of length s := [y^. Then, using the linear time 
strategy from Lemma 4, he determines the missing 0(n/logn) entries in 0(n/logn) guesses. 

To distinguish between the sampling and the linear time phase, Paul uses the last two entries 
suff ix(x) := [xn-iXn] of his string x. He has suff ix(a;) = [01], when he is in the random guessing 
phase, and he uses suffix(x) = [cc] for some c € C to indicate that he applies calls to LinAlg. 
Once Paul has determined all but the last two entries (visible from tn{x) = n — 1), he simply needs 
to sample uniformly at random from the set of all A;^ — 1 remaining possible strings. This clearly 
determines z in a constant expected number of additional queries (phase 3). 

The total expected number of guesses can be bounded by 

number of blocks de- queries needed to de- queries needed queries needed 
termined in phase 1 termine any such block in phase 2 in phase 3 

'^(i-e(iog-^n)) 0(2? + 0(5? + m =0(^). 

A non-trivial part is the random guessing phase. As in the proof of Theorem 2, after guessing 
t + k strings, we want to be able to regain the full guessing history. If we simply stored the random 
substring and Carole's reply in some unused part of x, then this changed memory would influence 
Carole's next answer and we would be unable to deduce information on the next guess from it. We 
solve this difficulty as follows. We store Carole's latest reply (i.e., value eq(z,x) currently in the 
memory) and we sample new (random) substrings for the current block at the same time. Here we 
store the value eq(z, x) in a part of x, for which we know the entries of Carole's hidden code. By 
this, we can separate in Carole's next answer the influence of the just stored information from the 
one of the random guess. The precise description of this Sampling strategy is presented below. 

To gain the storage space, for which we know the hidden code, we need to add another phase, 
phase 0, in which we apply O(logn) calls to the LinAlg procedure until we have determined the 
first ^ := In + ^ positions of z (cf. Lemma 4). This does not change the overall asymptotic number 
of queries Paul needs to win the game. 

The pseudo-code for this size-one memory-restricted strategy is given in Algorithm 7. Similar 
to the notation in the proof of Theorem 2, we denote for any h G [0..n] its binary encoding of length 
In by binary^ (/i) and for h G [0..s] we denote its binary encoding of length Is := [logs] -|- 1 by 
binary^^ (/i). The current block of interest i{x) is encoded in positions {n — — 1, . . . , n — 2}, i.e., 

we have i{x) := Ylh=o B^^^j.-^ := {I + {i{x) — l)s -|- 1, . . . , ^ -|- i{x)s}, and, consequently, 

BL0CKi(^.)(2;) : = 

[^i+{i{x)-i)s+i ■ ■ ■ xiJ^i(^x)s\- The total number of blocks, which we determine via 
random guessing, is h := [^^y^(1 — i^5^)J for some suitable large constant K. The number of 
random guesses for each block is t := \(2 + e) ^]a^^-ioJk ~\ '^^ig^'g e > is an arbitrarily small 
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constant. Lastly, the actual number of already sampled guesses for block B^f^^-^ is denoted by q{x). 
As in the proof of Theorem 2, q{x) can be computed via := max{i G [n — is — S] | = 1} , the 

largest position i < n — 2 — is with entry Xi = 1. Details how q{x) can be computed are given in the 
description of the OptimizeBlock routine, which, after t random samples have been sampled via 
the Sampling routine, determines BL0CKj(j.)(2;), stores it in -Bj(2,.) and increases the block counter 
i{x) by one. 

Algorithm 7: A size-one memory-restricted algorithm winning the A;-color black answer-peg 
only Mastermind game in 0(n/ log n) guesses. 

1 Initialization: Let 7W ^ 0; 

2 if 7W = then 

3 x ^ [c . . . c] for some c £ C chosen u.a.r.; 

4 Query eq{z,x) and update M. ^ {{x,eq{z, x))}; 

5 if 3c G C : suffix{x) = [cc] A tn(x) < i then 

6 1^ LinAlg ; / /find the first £ entries [2:1 .. . Zi] 

7 else if 3c G C : suffix{x) = [cc] A tn(x) =1+1 then 
X [^_^^^^^\^j_^^\xi . . . X(\ p . . . jbinary^^ (1)|01]; //copy prefix (which 

i hs n-{2l+bs+ls+2) 

coincides with the hidden code) 

Query eq(z,x) and update Ai by replacing (x, eq(z, x)); 

10 else if suffix{x) = [01] A i(x) < 6 A q{x) < t + k then 

11 1^ Apply Sampling; 

12 else if suffix{x) = [01] A i(x) < 6 A q{x) = t + k then 

13 1^ Apply OptimizeBlock; 

14 else if suffix{x) = [01] A i(x) = b+l then 

15 X ^ [xe+bs+s+1 ■ ■ ■ X2e+bs+s+i\xe+i . . . X£+bs|c . . . c] with c G C\{xi+bs} u.a.r.; 

16 Query eq(z,x) and update 7W by replacing (x, eq(2;, x)); //prepares x for LinAlg 

17 else if 3c G C : suffix{x) = [cc] A £ + bs < tn(x) < n — 2 then 

18 1^ LinAlg; 

19 else if 3c G C : suffix{x) = [cc] A tn(x) = n — 1 then 

20 Sample y G {[xi . . . x„_2|p] | p G C^}\{x} uniformly at random; 

21 Query eq(z,y); 

22 if eq(2;, y) = n then Ai ^ {{y, eq{z, y))} ; / /secret code found 

23 Go to line 2; 



Let us now present a more detailed description of Algorithm 7. 

As in the proof of Theorem 2 let us assume that Carole has chosen a fix code z G C", which she 
does not change during the game, i.e., to any of Paul's guesses x she replies eq(z,x). By adopting 
a worst-case view below, we implicitly still allow Carole to change z as long as the new choice is 
consistent with all previous replies. 

If in any iteration we find an x with eq(z, x) = n, we have x = z and all we need to do is to 
output X. Thus, in what follows we always assume eq(2;, x) < n. 
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Initialization of Algorithm 7, lines 1—4. For initialization, Paul picks a c £ C uniformly 
at random and guesses the all-"c"s string of length n, x = [c...c]. He updates the memory 
M ^ {{x , eq{z , x))} accordingly. This memory satisfies all conditions of line 1 of routine LinAlg 
(Algorithms 5 and 6) with tn(x) = 1. 

Phase of Algorithm 7, lines 5—6. To this string, Paul applies successive calls to the 
routine LinAlg. By Lemma 4 he finds a string y E C" with yi = Zi, i < i, and tn(y) = ^ + 1 in 
an expected number of 0{i) guesses. As mentioned above, Paul runs this first phase until he has 
determined the first 

£= [logn] + 2 = 4 + 1 

entries. This is the number 

in = \logn] + 1 

of entries needed to store in binary any integer value h G [0..n] plus 1. These bits shall be used 
in phase 1 of Algorithm 7 to indicate the status of the Sampling routine (first position) and for 
storing Carole's latest reply eq{z,x) G [0..n] (positions {2, . . . ,£})■ We shall describe this in more 
detail below. 

Intermediate step, lines 7—9. After Paul has determined the first i entries, he needs to 
prepare the string for the random guessing phase, which is the main part of Algorithm 7. Since 
we want to use the first i entries to store reference values, we need to make a copy of the prefix 
(which, by construction, coincides with Carole's hidden code). To this end, we query in line 7 the 
string 

y=[ \xi...xe\ P . . . Q |binary^^(l)|01], 

e+bs entries n-{2e+bs+es+2) entries £^ entries 

where x is the string that is currently in the memory, i.e., the string we obtained through phase 0^ 
and bs = n — G(n/logn) is the number of positions Paul determines using random guessing. As 
mentioned in the overview, the last two entries suff ix(a;) = = [01] indicate that we are 

entering the second phase. Note that throughout the game we have 

( [01], if we are in the second phase of the algorithm 
suffix(x) = < 

I [cc], for some c G C, otherwise. 

In positions {n — — 1, . . . , n — 2} we indicate in binary the block, which we are currently trying 
to determine. That is, whenever x is the current string in the memory with suff ix(a;) = [01], then 
the block currently of interest is -Bj(x) with i{x) = ^^=0^ 2^Xn-2-i ■ Here in this intermediate step 
we initialize i{x) = 1. 

After guessing y and updating the memory by replacing the current one with {{y,eq{z,y))}, 
Paul enters the first phase (as indicated by suff ix(a;)). The overall expected number of queries 
needed until this point is 0{i) = O(logn). 

First phase of Algorithm 7, lines 10—13. The first phase is the main phase of Algorithm 7. 
In this phase, Paul determines all but n — Q{n/ logn) entries by iteratively determining blocks of 
length s = \\/n~\ via random guessing. In total, he determines b = [^(l — j^^)J such blocks in 
this phase. The description of the routines Sampling (in which k reference strings and t random 
samples are queried for the i(x)-th block Bii^^-^) and OptimizeBlock (in which we use the reference 

^That is, we have tn(2;) = i + < t : Xi = Zi, and 3c £ C\{xiyii > tn{x) : Xi = c. 
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strings and the random guesses to determine BL0CKj(^)(2;), the z(x)-th block of the secret code z) 
is quite technical. We present the details after the description of the remaining phases. 

Second phase of Algorithm 7, lines 14—18. In the second phase of Algorithm 7 we again 
apply successive calls to routine LinAlg to determine all but the last two remaining entries. To 
this end, we first need to prepare the string. This is done in lines 14-16 of Algorithm 7. It follows 
from the correctness of the first phase that the string x queried in line 16 satisfies Xi = Zi for all 
1 < i < i + bs. And, by definition, it also satisfies / Xtn{x) with tn(x) = £ + bs + 1. 

Prom Lemma 4 we infer that via routine LinAlg we find a string x with tn(rE) = n — 1 and 
Xi = Zi for all 1 < i < ?i — 2 in an expected number of 0{n — 2 — [l + bs)) = 0(n/ log n) queries. 
These are lines 17 and 18 of Algorithm 7. 

Third phase of Algorithm 7, lines 19—22. Comparable to the last step of Algorithm 4, 
all we need to do in the last phase of Algorithm 7 is to determine the last two entries. This is 
done by sampling y uniformly at random from the set of possible target strings {[xi . . . x„_2|p] | 
p G {0, l}^}\{x} and we find y = z after a constant expected number of queries. This phase is 
recognized by the algorithm by the fact that tn(x) = n — 1. Note that we have tn(x) < n — 2 in 
the LinAlg phases — phases and 2 — and that we have tn(x) = n in phase 1. 

Summing up the expected number of queries needed for each phase, we have shown that Paul 
needs, on average, 

O(logn) + 0(n/ log n) + 0(n/ log n) + 0(1) = 0(n/ log n) 

until he has identified Carole's hidden code z. 

In the remainder of this proof we present the details of the first phase of Algorithm 7, the 
random sampling routine Sampling and the OptimizeBlock routine. As mentioned above, this 
description requires some technicalities. Therefore, we split it into the following parts: 

Part I In the first part, we present the general structure of the guesses Paul queries in the 
sampling phase. Here, we shall also show that the n positions are indeed sufficient to store, 
for any of the b blocks of length s all necessary information about the samples. 

Part II The second part, which is brief, provides further notation used in the pseudo-code 
of Algorithm 8. 

Part III The main part is the third one. Here we show how the contributions Aj(^.)(r) G [0..s] 
of the random samples r G can be computed. This also shows that indeed after sampling 
the t random guesses for the current block of interest, it is possible to regain the full query 
history using only the information that has been stored in the memory. This is clearly the 
most technical part of this proof. 

Part IV We conclude the description of phase 1 in the fourth part, where we explain how 
the memory is being updated, once the entries BLOCKj(^.)(z) of the secret code z in the i{x)-th 
block have been determined. 
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Part I The general structure of a random query x for determining block is the following. 

X = [^X]^|binary^^(eq(2;,y)) | opt(Si)| . . . |opt(Sj(^)) I I I I (1) 

(1) ' m ' ' {^) ' (4) (5) (6) 

binary^^^ (eq(z, |binary^^ (eq(z, x^)) 1 1 1 

V ' 

(7) 

binary,Jeq(2,ref«))|r«|A,(,)(r«)|l| . . . | 



(8) 

binary,Jeq(z,ref(*')))|r(*')|A,(,)(/)|l|0...0|binary,^(i(x))| 01 ], 

(8) continued (9) ^^^^ 

where we use 

(1) the first entry xi € {0, 1} to indicate whether we are sampling a new random substring 
[xi = 1) or whether we are doing a storage operation only through which we add to x all 
necessary information from the previous guess(xi = 0). An explanation of these operation 
follows below; 

(2) in entries for encoding the value eq(z,y) G [0..n] of the string y that is currently stored in 
the memory cell (serves as reference value), 

(3) {i{x) — l)s entries for the already determined blocks . . . , Bi(^x)-ii 

(4) s entries for the current block -Bj(2:) of interest. If we are sampling new information (i.e., if 
xi = 1), then the substring r is a string taken from uniformly at random and r is the 
all-zeros string of length s otherwise; 

(5) (6 — i(x))s zeros for the yet untouched blocks 3^/ with i{x) < b' < b, 

(6) £ entries for storing the length-^ prefix that coincides with Carole's hidden code (obtained 
trough phase 0), 

(7) 2£n + 1 entries for storing the values eq(z,x'^) and eq(z,x^) of the two reference strings x*^ 
and x^ (explanation follows), 

(8) t'{£n + s + £s + 1), t' < t, entries for storing, for each random sample, (i) the value eq(2:, ref ) 
for a reference string ref (in binary, requires £n positions), (ii) the random sample r G 
itself, (iii) its contribution Aj(^)(r) G [0..s] to Carole's reply (in binary, requires £s positions), 
and (iv) one additional "1" (to ease the computation of the number of guesses q{x) via pi(x); 
details follow), 

(9) £s entries for encoding in binary, which block we are currently trying to determine, and 
(10) the last two entries, suf f ix(x), for indicating the current phase of the algorithm. 



Clearly, one critical part is the limited storage capacity. For this reason, let us show that we 
have enough positions to store all the information needed to compute ^consistent ^ ^j^^ strings 
consistent with Carole's replies for the random guesses in the z(x)-th block -Bj(a:). 
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Recall that, by Theorem 3, for determining the i{x)-th. block BL0CKj(3,-)(z) of z, we need t = 
(2 + g) ^iog"^^\°g = ©(s/logn) random guesses (e > being an arbitrarily small constant). In 
addition, equivalently to the proof of Theorem 2, we need again 2 reference strings and 
(reference (7) in equation (1)). These two reference strings will be needed to infer the contributions 
Aj(a;)(r) of the random samples r G C in the i(x)-th block. 

From the structure of the guesses presented in equation (1) above, we infer that the total storage 
requirement can be bounded from above by 

1 + in+bs + l + 2ln + l + tiin + S + 4 + 1) + 4 + 2 

K 

= bs + ts -\- o{n/ log n) < n(l — ) + 0(n/ log n) + o(n/ log n) < n 

for sufficiently large, but constant K and sufficiently large n. This shows that, for sufficiently large 
n, Paul indeed can store all information needed to compute ^consistent single string of length 

n. 

Part II Let us now fix the notation used in the pseudo-code of routine Sampling (Algorithm 8). 
For all h' < i{x) we set 

opt(i?5') := . . . xij^b's] , 

the entries of x in the 6'-th block. The notation "opt" is justified by the fact that we shall have 
o^t{By) = BLOCK;,/ (z) for all b' < i{x). Furthermore, let 

AddRef erenceStringlnf o(x) := [0^_^| opt(i?i)| . . . |opt(i?j(^)„i) | 0^^^| 

(1),{2) ' ' (4) 

Xi+i(^^)s+i ■ ■ ■ X2e+bs \x2... x^|binary^^^(eq(2;, j;))|l | X2e+bs+2e„+2 ■ ■ ■ Xn] , 



(5), (6) (7) (*) 

where the references in the expression above are the same as the ones used in equation (1) and 
where (*) is simply a copy of the last entries of x. That is, the AddRef erenceStringInf o(j;) 
operation adds to x the values eq{z,x^) and eq(2:,x^) to the memory and each of these values is 
stored in binary notation of length i^- Lastly, we denote by Add(eq(z, x)) the operation 

Add(eq(2;,a;)) := [0^^| opt(Si)| . . . |opt(5i(^)_i) | | . ..Xp^(^^) \ (2) 

^^^'^^^ (3) (5),(6),(7),(8) 

X2 . . . 2;^|BL0CKi(^)(x)|binary^^(Ai(^)(BL0CKi(^)(x)))|l | 



(t) (*) 

which adds (in substring (f)) to the memory 

• a copy of the value eq(2;,ref) of a reference string ref (which was previously stored in 
positions {2, . . . , £}), 

• the random sample BL0CKj(^.)(2;) of the last guess, 

• the contribution Aj(2;)(BL0CKj(2;)(x)) of the random sample BLOCKj^^,) (x) to eq(z,x), and 

• the one additional "1" that shall ease the computation of q{x), the number of already queried 
samples. 
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All other but the first I entries (which are set to zero) are copied from x. The references in equation 
(2) are the same as in equation (1). 



Algorithm 8: The Sampling routine for k = 2 colors. 



1 Assumption: Memory M. = {(x, eq(z, x))} satisfies eq{z,x) < n, suffix(x) = [01], 
i{x) < b, and q{x) < t + 2; 

2 if q{x) = A Xi = then 
3 
4 



X ^ [l|binary^^(eq(2;,x))|opt(Si)| . . . |opt(5i(^)_i)|l . . . Ijx^^^ 

{x)s+l ■ ■ ■ ^n\] 

Query eq(z,x) and update by replacing (x, eq(z, x)); 



5 else if q{x) = A xi = 1 then 

X ^ AddRef erenceStringInf o(x) ; 

Query eq(z,x) and update ^A by replacing (x,eq(z,x)); 

8 else if 2 < q{x) <t + 2Axi = then 

X ^ [l|binary^^(eq(z,x))|opt(Si)| . . . |opt(5j(^.)_i)|r|x^+j(^)^+i . . . x„] for r € C chosen 
u.a.r.; 

Query eq(z,x) and update 7W by replacing (x,eq(z,x)); 
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11 else if 2 < ^(x) <t + 2Axi = l then 



12 
13 



X <— Add(eq(z, x)); 

Query eq(z,x) and update by replacing (x,eq(z,x)); 



Part III Let us now show in detail how to infer the contributions Aj(^)(r) of the random 
guesses. For clarity, we show how to do this for the first block, i.e., for the positions {£ + 1 . . .£ + s}. 
The procedure is similar for all other blocks and we shall comment on this case at the end of this 
part. 

First note that after the intermediate step in lines 7 to 9 of Algorithm 7, Paul enters the routine 
Sampling with A4 = {{x^ , eq{z , x^))} , where 

x° = [0 ... 0\zi...ze\ ... |binary^^(l) |01] , 

i+bs entries n-{2i+bs+es+2) entries entries 

and he queries in the first sampling iteration (lines 2-4 of Algorithm 8) 

x^ = [l|binary^^^(eq(z,x°))| 1...1 I ... \zi . . . Zi\ ... jbinary^^ (1)|01] 



l+^n=^ entries 



s entries {b-l)s entries n—{2i+bs+is+2) entries 



with the all-ones substring in the first block. We can compute the contribution of the first i entries 
[l|binary^^(eq(z, x'^))] to the value eq(z,x^) via 

/(x^) := eq([2;i . . . z^], [l|binary^^^(eq(2;, x°))]) = eq([xj+fe,+i . . .x^^+;,J, [x} . . .x^]) , 

and, by the same reasoning, the contribution of the first £ entries in x'^ to eq(z,x'') via 
f{x^) = eq([x°_,_jj,_,_j^ . . . Xg^^^^], [0 . . . 0]). Let us, for a moment, assume that we now had A4 = 
{(x^, eq(z, x^))} and that we had another string 

y = [zi...Zi\ ^ I \zi...ze\ |binary^^(l)|01] 

s entries (b— l)s entries n—{2£+bs+is+2) entries 



18 



for some random substring r G Then we could compute the contribution of the random entries 
r in the first block Bi of y via 

Ai(r) = eq(z y) - eq(^> + ^^(^^ + " />")) + " /(^')) " ^ 



Key to this equality is the fact that the first £ entries of y contribute i to Carole's response 
eq{z,y) to guess y, whereas the first £ entries of and contribute f{x^) + f{x^) to the sum 
eq(z, x*^) + eq(z, x^) and the fact that the entries in the first block, [0 ... 0] and [1 ... 1], respectively, 
contribute in total s towards eq(z, + eq(z, x^). All other entries Xi,yi, i > £ + s contribute either 
2 or to the sum eq{z,x^) + eq{z,x^) and every entry contributes 2 if and only if it contributes 1 
to the value eq{z,y). 

Note however, that we would now have to choose now which of the strings to keep in the memory 
and we would eventually loose the information eq(2:, x^). Therefore, in lines 6 and 7 in Algorithm 8, 
we first query the reference string 

= [ ^j^^^ I ■ ■ ■ Zi\ binary^^(eq(z, x''))|binary^^(eq(2;, | 

Gentries sentries (b-l)s ' 2£„+rentries ' 

0^ |binary^^(l)|01]. 

n- {2e + bs + 2t„ + 
l + e.s + 2) entries 

This query is needed only to store the values eq{z,x^) and eq(2;,x^) of both reference strings. 
Since adding the substring [binary^^(eq(2;, x'^))|binary^^(eq(z, to the memory string again 
changes the number of positions, in which the guess and Carole's hidden string coincide, we need 
to store this information in the next query as well. More precisely, we have that x^ and x^ differ 
in exactly the substring [binary^^^(eq(z, x°))|binary^^(eq(z, and the contribution of this 
substring (compared to the all-zeros substring which it replaces) is eq{z,x'^) — eq{z^x^). 

Furthermore, we need to indicate that we are sampling a new random substring. This is the 
first position in the string and the next £ — 1 entries are needed to encode in binary the value 
eq(2;,3;^). That is, instead of querying y as above we query (lines 9 and 10 in Algorithm 8) 

x^ = [l|binary^^(eq(z,x^))|r(^)|0 . . . Olzi . . . Z£|binary^^(eq(2;, x°))|binary^^(eq(z, a;^))|l| 
0...0|binary,^(l)|01], 

where the substring r^^^ G C in Bi is taken uniformly at random. The number of zeros in the 
first all-zeros substring is again (6 — l)s and in the second all-zeros substring it is n — {2£ + 
bs + 2£n + 1 + £s + 2). Now, in the same fashion as above, we can compute the contribution 
Ai(r(^)) = eq([2;f+i . . . of the substring € via 

Ai(r(i)) = eq{z,x') - [ ""^^'^ "^"^ + "^^^' + " /V)) + " /(^°)) " ' 

+ (eq(z, x2) - eq{z, x^)) - {£ - /V))) ■ (3) 

Also note that all the information needed for this computation is contained in the string x^ itself. 

Since later we want to be able to regain the full guessing history, in the next guess we store 
both the reference value eq(2:, x^) as well as the contribution Ai(A^'^). And, of course, we also 
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Algorithm 9: The OptimizeBlock routine 



1 Assumption: Memory M = {{x,eq{z,x))} satisfies eq{z,x) < n, suffix(x) = [01], 
i{x) < b, and q{x) = t + 2; 

2 if xi = then 

3 
4 
5 



y^[l... l|opt(Bi)| . . . |opt(5i(^)_i)|w;|3;^+i(^),+i . . . x„] for w G ^^^te^^t chosen u.a.r.; 
Query eq{z,y); 

if Ai^^){BLOCKi(^^){y)) = s then M ^ {(?/, eq(z, y))} ; //w = BLOCKi(^)(z) 



6 else 



X ^ Update (x); 

Query eq{z,x) and update ^A by replacing (x, eq(z, x)); //string prepared for 
determining the next block 



need to store the random guess A^'^ = BLOCKi(x^) itself. Therefore, we query (lines 12 and 13 in 
Algorithm 8) in the next iteration of Algorithm 7 

x'^ = [0^^^^ I 0_;_^^ I 0_;_^^l-2i • • • •Z£|t)inary£^(eq(z, x'^))|binary^^(eq(2:, x"^))|l| 

binary£^(eq(z,x^)) |BLDCKi(x^) |binary^^(Ai(r(^))) |1|0 . . . 0|binary^^(l)|01] . 

= [x|...x|] =r(i) see equation (3) 

Note that since Ai(r^^^) £ [0..s], we can encode this value using is positions only. 

By continuing like this we are able to compute, in any iteration of the first phase, the contri- 
butions Aj(^)(r) of the random substrings r G C. 

As in the proof of Theorem 2 we need to be able to compute how many random guesses have 
been queried already for the current block of interest. As indicated above, this can be derived 
from pi(x) as follows. For any random guess r G C we use in + s + ig + 1 entries for storing all 
information that will be needed later to regain the full guessing history. Furthermore, we used 
2in + 1 entries for storing the values eq(2;, x") and eq(z, x-^) of the two reference strings x*^ and x^, 
and we store information only in positions i > 2i + bs. Hence, the number of guesses for block -64(2:) 
can be computed as 

{0, if pi(x) < 2£ + 6s and xi = 

1, if pi{x) <2i + bs and xi = 1 

^^'-'^'^imSi^^ otherwise. 

After querying t random guesses (i.e., after querying a total number of t + fc guesses) for the 
first block, we regain the full guessing history from the string x currently in the memory as follows. 
The i-th random sample r^*) G C*, which we guessed for the first block is 

rW := [x2e+bs+2£„+l+{i-l){ln+s+ls+l)+in+l ■ • • ^2£+bs+2£„+l+(i-l)(£„+s+£,+l)+£„+s] ) 

and the corresponding query was 

y(') := [l\x2e+bs+2e„+i+{i~i){en+s+es+i)+i ■ • • ^2€+fes+2£„+i+{i-i){fn+s+4+i)+fnl 
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We have stored in binary the contribution Ai(r(*)) of A^^ to the overall function value eq{z,y^''^) 
in positions 

{2i+bs + 2en + l + {i-l){en + S+is + l)+en + S + l . . . 2£+6s + 24 + 1 + (i - 1) + + 1) + 

and thus we have 

Al(r(*)) = ^ 2''X2(,+hs + 2ln + l + {i-l){i„+S+is + l)+ln+S+ls-i- 

1=0 

By Theorem 3, the expected size of 

^consistent ._ |^ ^ 1}^ | Vi < t : eq(y,r(^)) = Ai(r«)} 

is bounded from above by 1 + 1/s. That is, we can now identify BLOCKi(z) in a constant number 
of guesses. These are lines 3 - 5 of routine OptimizeBlock (Algorithm 9). 

As mentioned above, determining the other blocks 2, . . . , 6 is similar. In these iterations, the 
{i{x) — l)s entries in positions {£+!,..., i+{i{x) — l)s} are already optimized, that is, they coincide 
with Carole's hidden string z. Thus, they are not changed in any further iteration of Algorithm 9. 

Part IV Once BLOCKj(^.) (z) = [z£^(^ii^j.)_i)s+i ■ ■ ■ Z£+i(x)s] has been determined, we need to update 
the memory such that we can start determining the entries of the next block. These are lines 7 and 
8 in Algorithm 9. Here we abbreviate 

Update(x) := [0. | opt(5i)| . . . |opt(Bj(^)) | 0. | x^+i^+i . . . X2£+bs \ 

(«) (c) (d) 

I binary^^ (i(x) + 1) | ^OlJ , 

ie) ^ ^) ' (9) 

where 

(a) the first i entries are set to zero, 

(b) we now have i{x) already determined blocks, 

(c) the new block of interest, block i{x) + 1, as well as all blocks b' > i{x) + 1 are (still) set to 
zero, 

(d) we keep the copy of the prefix [^i . . . zi] in positions {£ + bs + 1, . . . ,2i + bs}, 

(e) all information that we have used in the previous query to determine block -Bj(a;) is removed 
(and set to zero), 

(f ) the index for the current block of interest is increased by one, and 

(g) the last two entries still indicate the second phase. 

The case of A; > 3 colors. For the general case, the main strategy as given by Algorithm 7 
remains the same. What needs to be changed is the Sampling routine, where instead of sampling 
only two reference strings x^ and x^, we need to sample k reference strings with 
BLDCKi(^.)(j;^) = [c. . . c] for all c E [0..A; - 1]. 
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Algorithm 10: The Sampling routine for k > 3 colors. 



1 Assumption: Memory M = {{x,eq{z,x))} satisfies eq{z,x) < n, suffix(x) = [01], 

i{x) < b, and q{x) <t + k; 
1 if q{x) = A xi = then 

3 
4 



X ^ [l|binary^^(eq(z, x))\o^X.(Bx)\ ■ ■ ■ |opt(5i(^)_i)|l . . . l|a;£+j(^),+i 
Query eq(z,a;) and update M. by replacing (x, eq(2;, x)); 



5 else if q(x) = A xi = 1 then 

6 X *r- AddRef erenceStringInf o(x) ; 

7 Query eq(z,x) and update M. by replacing (x,eq(z,x)); 

8 else if 2 < q{x^ < k A xi = then 



9 
10 



X ^ [l|binary^^(eq(z,x))|opt(Bi)| . . . |opt(5i(^)_i)|g(x) . . . g(x)|x^+i(^.),+i . . . x„]; 
Query eq(z,x) and update by replacing (x,eq(z,x)); 



11 else if 2 < ^(x) < A; A xi = 1 then 

12 X -(r- AddRef erenceStringInf 02(x) ; 

13 Query eq(z,x) and update ^A by replacing (x,eq(z,x)); 

14 else ii k < q{x) <t + k A xi = then 



15 
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X ^ [l|binary^_^(eq(z,x))|opt(Bi)| . . . |opt(5i(3.)_i)|r|x^+j(^)s+i . . . x„] for r e C chosen 
u.a.r.; 

Query eq(z,x) and update ^A by replacing (x,eq(z,x)); 



17 else it k < q{x) <t + kAxi = \ then 



18 
19 



X ^ Add(eq(2;, x)); 

Query eq(z,x) and update M. by replacing (x, eq(2;, x)); 



Algorithm 10 shows the generalized sampling routine. Here we define 
AddRef erenceStringInf 02(x) : = 

[O^^l Opt(Bl)| . . . |opt(5i(^)_i) I O^^l X^+i(a.)s+i . . .Xpj(3.) I 
(^)'(^) (3) ("^^ (5), (6), (7), (7') 

X2 . . .x^|binary^^(eq(z,x))|l | Xp^(2.)+^,^+^+^^+2 • --Xn] , 

^ V ' V ' 

(t) (*) 

where 

(l)-(7) are the same references as in equation (1), 

(7') are the additional positions needed for storing the values eq(2:, x-') of the already queried 
reference strings x^, . . . , x'^^^^~^ (each requiring 2£„ + 1 positions), 

(t) we add the information of the g(x)-th reference string x'^^^'^ to the memory (again requiring 
2in + 1 positions), and 

(*) is simply a copy of the last entries of the previous guess. 
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The substring [x2 ■ ■ ■ X(\ is needed again to infer the contribution of the positions, in which we 
added the information of the previous reference string x'^^^^~^ . The reasoning is the same as in the 
case of A; = 2 colors. 

Since we added more reference string information, we need to adjust the definition of q{x) 
accordingly. Since we need 2^ + 1 additional bits for each reference string x^ , 2 < j < k, and we 
use in + s + is + 1 entries for storing the information of each random guess, we have 

'O, ifpi(a;) <2i + bs and xi = 
1, if pi{x) <2i + bs and xi = 1 
'~ I 3, if Pi{x) = 2i + bs + 24, + 1 + (j - 2)(24 + 1) and 2 < j < A; 
^ Pi(x)-(2Ws+(fc-i)(2^„+i)) otherwise 

It is easily verified that all statements made in the above proof for k = 2 colors remain cor- 
rect if we consider the general case of A; > 3 colors. Only the computation of the contribu- 
tions Aj(^)(BL0CKj(2,)(x)), equation (3), becomes a bit more tedious. However, all calculations are 
straightforward. We omit the details. □ 
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